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Abstract — For the tree topology, previous studies show the 
maximum likelihood estimate (MLE) of a link/path takes a 
polynomial form with a degree that is one less than the number 
of descendants connected to the link/path. Since then, the main 
concern is focused on searching for methods to solve the high 
degree polynomial without using iterative approximation. An 
explicit estimator based on the Law of Large Numbers has 
been proposed to speed up the estimation. However, the estimate 
obtained from the estimator is not a MLE. When n < oo, 
the estimate may be noticeable different from the MLE. To 
overcome this, an explicit MLE estimator is presented in this 
paper and a comparison between the MLE estimator and the 
explicit estimator proposed previously is presented to unveil the 
insight of the MLE estimator and point out the pitfall of the 
previous one. 

Index Terms — Loss tomography, Tree topology. Transforma- 
tion, Explicit Estimator. 



I. Introduction 

Network tomography is proposed in 1 1 1 to obtain net- 
work characteristics without modifying network infrastructure, 
where the author suggests the use of end-to-end measurement 
and statistical inference together to estimate the characteristics 
instead of direct measurement. The end-to-end measurement 
can be divided into two classes: passive and active, depending 
on whether probe packets are sent from sources to receivers. 
Without probing, the passive methods depends on the data 
collected from log files to estimate network characteristics. 
However, the data collected in the log files are either unrelated 
or poorly related that makes inference hard if not impossible. 
In contrast, the active approach attaches a number of sources 
to some end nodes of a network that send probe packets to 
the receivers attached to the other side of the network, where 
the paths from the sources to the receivers cover the links of 
interest. Since the probes are multicast to the receivers, the 
observations obtained by the receivers are strongly correlated. 
Then, statistical inference is applied on the data collected by 
the receivers to estimate the network characteristics, such as 
link-level loss rates [21, delay distribution |i3, 0, E, 0, Q, 
and loss pattern |8|. In this paper, our focus is on using active 
approach to estimate the loss rate of a path/link. 

Loss rate estimation is also called loss tomography in 
literature, where the main focus is on searching for efficient 
maximum likelihood estimators that can avoid the use of 
iterative procedures to approximate the MLE. To achieve this, 
a deep analysis of the likelihood function and a comprehensive 
study of the likelihood equations obtained from the likelihood 
function are essential. Unfortunately, there are only a few 
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analytical results presented in the literature, ["91 and |TOl are 
two of the few. Both papers show that when the Bernoulli 
loss model is assumed for the loss process of a link and 
independent identical distributed ii.i.d) probing is used in end- 
to-end measurement, the maximum likelihood equation of the 
pass/loss rate of a path/link takes a polynomial form. The 
difference between them is that f9| is for the pass rate of 
a path connecting the source to an internal node, |[Tol is 
for the loss rate of a link connecting two nodes that form 
a parent and child pair We call them path-based estimator 
and link-based estimator, respectively. Both estimators target 
the tree topology, and their advantages and disadvantages are 
presented in fT\\. Apart from agreeing on the polynomial 
form, both report that the degree of the polynomial is one 
less than the number of descendants connected to the path/link 
being estimated. Then, how to solve a high degree polynomial 
becomes the critical issue since there is no analytical solution 
for a polynomial that is 5 degree or greater from Galois theory. 
Unfortunately, there has been little progress in this regard until 
[12], where a connection between observations and the degree 
of the polynomial is established that provides the theoretical 
foundation to reduce the degree of the polynomial obtained 
from the likelihood equation. Prior to |fT2). the authors of 
1 13 1 introduced an explicit estimator built on the law of large 
numbers. The estimator has been proved to be a consistent 
estimator and has the same asymptotic variances as that of an 
MLE to first order When n < oo, the estimate obtained by 
the estimator can be very different from the MLE. Considering 
the cost of probing and dynamic nature of network traffic, we 
argue here that despite the importance of the large sample 
theory in statistics, it is unwise to use an estimator that is 
purely based on the law of large numbers in practice because 
the accuracy of the estimator depends on a large number of 
samples that can take a long time to collect and cost a lot of 
resources. 

The question then becomes whether there is an explicit 
maximum likelihood estimator for multicast loss tomography. 
If so, what is that and how different between the estimates 
obtained by the explicit MLE estimator and the estimator 
presented in |13| when n < oo. The two issues will be 
addressed in this paper. Firstly, we present an explicit MLE 
estimator for the tree topology under the Bernoulli model that 
has a similar computation complexity as the one presented 
in ifTSll . Secondly, a comparison between the two estimators 
is presented that shows the newly proposed estimator is better 
than the previous one when n < oo. The new estimator is also 
better than the previous one in terms of the rate of convergence 
when n ^ oo since the MLE is asymptotic efficient and the 
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best asymptotically normal estimate. 

By expanding both the statistical model used in the likeli- 
hood equation and the observations obtained from receivers, 
we found that the accuracy of an estimator is related to 
the consideration of the correlated observations; while the 
efficiency of an estimator is inversely related to the degree 
of the likelihood equation that is proportional to the number 
of correlations. Then, how to keep the accuracy without losing 
efficiency becomes the key issue that has been under investi- 
gation for some time. As a result, the connection between the 
degree of the polynomial and the observations obtained by 
receivers is established that sets up the foundation to have 
an explicit maximum likelihood estimator. Meanwhile, the 
exactly cause of the larger variance created by the explicit 
estimator presented in f\3l is identified in the paper. 

The rest of the paper is organized as follows. In Section 2, 
we introduce the notations used in the paper. In addition to 
the notation, the set of sufficient statistics used in this paper is 
introduced in this section. We then present the explicit MLE 
in Section 3 that unveils the connection between sufficient 
statistics and the likelihood model used to describe the loss 
process of a path/link. Section 4 is devoted to compare and 
analyze the explicit MLE with the estimator presented in |fT3l . 
The last section is devoted to concluding remark. 

II. Problem Formulation 

A. Notation 

In order to make correlated observations at the receivers, 
multicast is used to send probes to receivers, where the 
multicast tree or subtree used to connect the source to receivers 
is slightly different from an ordinary one at its root, that has 
only a single child. Let T = (V, E) donate the multicast 
tree, where V — {vo,vi, ...Vm} is a set of nodes representing 
routers and switches of a network; E = {ei, ...,em} is a set 
of directed links connecting node f{i) to node i, where f{i) is 
the parent of node i. To distinguish a link from another, each 
link is assigned a unique number from 1 to m; similarly, each 
node also has a unique number from to m, where link i is 
used to connect node f{i) to node i. The numbers are assigned 
to the nodes from small to big along the tree structure from 
top to bottom and left to right. The source is attached to node 
to send probes to the receivers attached to the leaf nodes of 
T. R is used to denote all receivers. Let A = {^i, ...,Am} 
be an m-element vector, where Ai,i G {1, ■■,m}, is the pass 
rate of the path connecting node to node i. In addition, 
except leaf nodes each node has a number of children, where 
di denotes the children of node i and \di\ denotes the number 
of children of node i. Note that a multicast subtree is different 
from an ordinary subtree, where multicast subtree i, T{i), is 
rooted at node f{i) that has link i as its root link. The group 
of receivers attached to T{i) is denoted by R{i). If n probes 
are dispatched from the source, each probe i = 1, n gives 
rise of an independent realization X^'^ of the probe process 
X, XI — l,k G R if probe i reaches receiver k; otherwise 
Xl = 0. The observations of 17 = (x'*')*^''^' -'"^ comprise 
the data for inference. 

Given observation X^^\ let 



= V ^fc' je{i,-,"}. (1) 

keR{i) 

be the observation obtained by R{i) for probe j. If Y/ — 1 
probe j reaches at least one receiver attached to T{i), that also 
implies the probe reaches node i. Then, 

n 

is the number of probes confirmed from observations that 
reach node i. ni{l), i G V\0 have been proved to be a set of 
minimal sufficient statistics in lfT2l . 

In addition to ni(l),i S 1^ \ 0, we also introduced 
another set of numbers for each node, k, where ny (l) = 
^ Yj^),i,j e dk is for the number of probes 
confirmed from observations that reach at least one receiver 
of R{i) and one of R{j) simultaneously; and 71^^.(1) = 

€ dk is for the number of probes 
confirmed from observations that reach simultaneously to at 
least one receiver in each of R{i), R{j) and i?(fc); • • •; and 
nc{l) = J2Z=i(/\jeG^j^) probes observed by at 

least one receiver in each subtree rooted at node k. We did not 
realize that this set of numbers is a set of sufficient statistics 
until recently, we then name them as the set of alternative 
sufficient statistics. The following theorem confirms this: 

Theorem 1. The alternative set of statistics defined above 
is a set of sufficient statistics. 

Proof: As stated, i G V\Q has been proved to be a 

set of minimal sufficient statistics. Then, if there is a function 
r that can map the alternative set to ni(l),i e \ 0, the 
alternative set is a set of sufficient statistics. The function F 
is as follows 

j.kedi 

n 

The function is a recursive function from bottom up along the 
tree topology. ■ 

III. The Explicit Maximum Likelihood Estimator 

A. Explicit Estimator 

Among the few studies providing analytical results. Mul- 
ticast Inference of Network Characters (MINC) is the most 
influence one that covers almost all of the areas in network 
tomography, including link-level loss, link delay and topology 
tomographies. In loss tomography, it uses a Bernoulli model 
to describe the losses occurred on a link. Using this model, 
the authors of [9] derive an MLE for the pass rate of a 
path connecting the source to an internal node. The MLE is 
expressed in a set of polynomials, one for a path ||9l, lfT4ll . 
IIT5I . Once knowing the pass rates to two nodes that form a 
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parent and child pair, the loss rate of the hnk connecting the 
two node can be calculated by 1 AijAj^^-^. Considering the 
complexity of using numeric method to solve higher degree 
polynomials (> 5), the authors of ifTSl propose an explicit 
estimator on the basis of the law of large numbers, where the 

■ ' and Bk = P{Zk = 1). 



authors define Z 



ii) 



The key of |13 | is based on the following theorem. 
Theorem 2. 1) For K e V\Ii 



Afc = $(Bfc,7) := 



(3) 



2) Define Ak = Jk for k ^ R, and Ak = $(i?fe,7) 
otherwise. Then Ak is a consistent estimator of Ak, and 
hence Ak — Ak/Af(^k) consistent estimator of Ak- 

where Bk, the empirical probability of Bk, is equal to 
X]"=i ^i^^ ■ Note that the consistent property proved only 
ensures when n ^ oo, Ak is almost surely approach to Ak, 
the true pass rate of the path from the source to node k. 
This property is the basic requirement for an estimator If an 
estimator cannot ensure consistency, it should not be called an 
estimator. Then, the main concern with the explicit estimator 
is its accuracy in comparison with the MLE when n < oo sine 
it only uses a part of all available information. 

B. Insight of MLE 

A minimum variance and unbiased estimator (MVUE) is 
normally regarded as a good estimator in statistics. A maxi- 
mum likelihood estimator is a MVUE if it meets some simple 
regularity conditions. Unfortunately, the estimator proposed in 
ifTJl is not a MLE. Apart from that, there are a number of 
concerns with the applicability of the estimator in practice 
because the accuracy of the estimate requires n — > oo. Then, 
the scalability of the estimator must be addressed, where 
the time and resources spent on measurement, time spent 
on processing the data collected from measurement, and the 
stationary period of network traffic must be considered in 
practice. 

To remedy the scalability of the explicit estimator, we 
start to search for an explicit maximum likelihood estimator 
and have a close look at the maximum likelihood estimator 
proposed in ||9l, which is as follows: 



H{Ak,k) = 1 



Ak [^^ Ak' 



(4) 



jedk 



where 7^ , i e F \ is the pass rate of the multicast tree with 
its root link connecting node to node i. Rewriting ^ as 



1 - 



Ak 



(5) 



we found two interesting features of the polynomial; one is the 
expandable feature, the other is merge-able feature. The former 
allows us to expand both sides of (|5]l to have a number of terms 
on each side that are corresponded to each other. The latter, on 
the other hand, allows us to merge a number of terms located 
on the right hand side (RHS) and in the product into a single 



term as the one located on left hand side (LHS) of (|5]l. The 
advantage of the merge-able feature will be detailed in the next 
subsection. We now put our attention on the expanding feature 
to unveil the internal correlation embedded in Q. Expanding 
the RHS of Q and dividing all terms by Ak, we have 



7fc 



E 2^ 

j<k 

j-kedk 



Ak 



+ (-1) 



A 



(6) 



to replace in 



Using the empirical probability jj 
(|6]l, we have a — 1 degree polynomial of Ak- Solving the 
polynomial, the MLE of path i, Ak, is obtained. Note that 7^ 
can be replaced by the alternative sufficient statistics, then the 
LHS of (|6]l becomes 



n,(l) 



^ ^3 

■j<k 
j,k(£dk 



jedk 



Comparing the RHS of (O with ([7|i, one is able to find 
the correspondences between the terms. Each term of ^ 
represents a type of correlation in the model among/between 
the members of the term, while each term of (|7]l is the statistics 
or evidence obtained from an experiment for the corresponding 
term of (|6|. Except the first term of ^ and the first term of 
^ that are exactly equal to each other, other pairs between 
the two can be different from each other if n < 00. Taking 
the first terms of ^ and ^ out, we have 



j,kedk 
j,k<£dk 



-iHjedk 7j 



A 



dk\ 



(8) 



Statistical inference aims to estimate Ak from ([8]l, i.e. match- 
ing the model presented on the LHS to the statistics presented 
on the RHS. This equation also shows that in order to have 
the MLE of a path, one must consider all available infor- 
mation embedded in the observations of R{i), in particular 
the correlations between the descendants. Without correlations 
and/or the corresponding statistics, inference is impossible. 
This corresponds to the data consistent problem raised in |9l 
and |ir|. If we only consider matching a part of (|6]l to the 
corresponding part of ^ when n < 00, the estimate obtained 
would not be the MLE unless the ignored correlations are 
negligible. Then, the explicit estimator proposed in |13 | is not 
an MLE since it only pairs the last term of ^ to the last term 
of©. 

C. The Explicit Maximum Likelihood Estimator 

As stated, the degree of the polynomial is proportional 
to the number of descendants connected to the path being 
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estimated and the estimation relies on the observed correla- 
tions between the descendants to estimate the unknown char- 
acteristic. Under the i.i.d. assumption, the likelihood function 
takes a product form as (|5]). Unfortunately, the previously 
stated merge-able feature has not been given enough attention 
although (|5]i clearly expresses the loss rate of subtree k is 
equal to the product of the loss rates of those sub-multicast 
trees rooted at node k. In probability, ([Sj states such a fact that 
the loss rate of subtree k depends on a number of independent 
events, one for a sub-multicast tree rooted at node k. More 
importantly, this implies that those independent events can be 
merged into a single event, i.e. the LHS of (|5]l. With this in 
mind, whether the degree of (HJl can be reduced depends on 
whether we are able to obtain the empirical pass rate of the 
tree that has a path from the source to node k plus some 
of the multicast subtrees rooted at node k. Let 7^ denote 
the pass rate, where k is for the end node of the path being 
estimated, g denotes the group of subtrees being merged. 
Based on the alternative sufficient statistics of dk, 7fcg, the 
empirical probability of jk , can be computed fT^I. Then, the 
degree of (|5]l can be reduced to 1 that can be solved easily. 
Further, we have the following theorem to calculate the pass 
rate of a path explicitly for the tree topology. 

Theorem 3. For the tree topology that uses the Bernoulli 
model to describe the loss process of a link, there is an explicit 
MLE estimator to estimate the pass rate of the path connecting 
the source to node /c. fc € \ (0 U R), which is as follows: 



7fei7/c2 



where 7^ 



Iki 



f 7fc2 - Ik 
"fei(l) 



and 7fe2 



(9) 

"fc2(l) 



n n n 

nfei(l) and nfc2(l) are the number of probes confirmed from 

observations reaching at least one receiver attached to the 

merged subtree 1 and 2, respectively. 

Proof Since node k is not a leaf node, the subtrees rooted 
at the node can be divided into two exclusive groups: dki and 
dk2 where dki U dk2 = dk and dki n dk2 = 0- The statistics 
of the merged subtrees can be computed by using dki or dk2 
to replace dk in (|7]i. Then, we have 



Ak Ak 



(10) 



Solving ([Tol l, we have 



Ak 



lkllk2 



Ikl + 7fc2 - 7fc 



The theorem shows that using (|7J to merge the alternative 
statistics of multiple multicast subtrees rooted at the same node 
would not affect the estimation of the pass rate of the path that 
ends at the node. 




Fig. 1. A Binaiy Tree 



IV. Comparison of Estimators 

In this section, we tackle the second task set at the be- 
ginning of the paper, i.e. compare the explicit MLE estimator 
against the explicit one presented in |[T3l for n < 00. Two 
scenarios: 2 descendants and 3 descendants are connected 
to the path being estimated, are considered to illustrate the 
estimates received from the explicit estimator proposed in 
1131 drifts away from the MLE as the number of descendants 
increases. To make the variance approximate to the first order 
of the MLE, the explicit estimator needs to send more probes 
to receivers. We will use H{Ai,i) as a reference in the 
following comparison to measure the accuracy between an 
estimator and its MLE counterpart. 

A. Binary Tree 

For a tree with binary descendants as Figure [T] the pass 
rate of link/path i estimated by H{Ai,i) is equal to 



A,, 



7271 



72 + 71 - 7j 
Using the explicit estimator of I.13J . we have 



(11) 



A,, 



7271 

«12(1) 



7271 



n2{l) + ni{l) ~ mil) 



7271 



72 + 71 - 7i 



(12) 



It is the same as the MLE. This is because there is only one 
type of correlation between the model and the observations, 
which are considered by the estimator Thus, Ai — Ai. 

Based on Theorem[3] the estimate obtained by the estimator 
proposed in this paper is also the same as above, which can 
be written as: 



A,. = 



ni(l)n2(l) 



n-(ni(l) + 712(1) 
ni(l)n2(l) 

11? 

ni(l) + n2(l) -«,(!)■ 



(13) 



Thus, for the binary tree, the three estimators produce the 
same result. 
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B. Tertiary Tree 

Let i have three descendants 1, 2, and 3. Based on H{Ai, i), 
we have 



^lili + 72 + 73 - 7i) - 

^i(7i72 + 71 73 + 7273) + 717273 = 0. 



(14) 



Solving the quadratic function, we have the MLE of Ai 
Based on (O, the model and observations are connected by 



E 



^123(1) 



j<k 

j,fce{i,2,3} 



E 



j<k 
j,fce{i,2,3} 



7j7fc _ 717273 
i, i? 



(15) 



It is easy to prove ( fTSl l equals ( fT4] i. Using theorem [3] we 
have the MLE directly, 



A, 



(ni(l) + n2(l)-ni2(l))-n3(l) 



(16) 



n ■ (ni3(l) + "-23(1) - "-123(1)) 

Intuitively, one can notice that (fTSI l considers all correlations 
between the 3 descendants. To prove this equals (fT4l) . we write 
the RHS of the above as 

(ni(l)+n2(l)-ni2(l)) ^ rigW 

n (17) 

("-13(1) + ^^23(1) - ^^123(1)) 



Because of the symmetric nature of the 3 descendants, we 
can also merge descendants 2 and 3 first or merge descendants 
1 and 3 first, that lead to 



and 



A, 



A, = 



(n2(l)+n3(l)-n23(l)) -nill) 
n ■ [nisil) + "12(1) - "123(1)) 

(ni(l) + 713(1) ~ni3(l))-»2(l) 
n ■ ("12(1) + "23(1) - "123(1)) 



respectively. 

In contrast, the explicit estimator presented in (13] has its 
estimate 

717273 \ 2 



A,. = 



"123(1) 



(22) 



Comparing (|22] | with ( fT6b . a direct impression is that 
fails to match the paired correlations, i.e. descendants 1 and 
2, descendants 1 and 3, and descendants 2 and 3. If we assume 
(|22] | is a solution of a quadratic equation, the equation should 
be as follows: 



"123(1) .2 



' "123(1)717273 



+717273 = (23) 



that has a double root. Given the double root assumption, 
is certainly not the polynomial that leads to MLE since it 
is contradict to the Lemma 1 introduced in ||9l and 112 1 that 
states there is one and only one root in (0,1) for the maximum 
likelihood equation. Then, the estimator proposed in ||T3l can 
be regarded as an estimator based on the method of moments 
that will be discussed in the next subsection. 



The denominator of the above is obtained from 



ni(l) + 712(1) -ni2(l) , n3(l) 



l^■ (18) 



According to (|6]l and dTjl, 

7172 ni(l) + ra2(l) - ni2(l) 

71+72 r~ ^ 

A, n 

Using the above in (fTsT l, the denominator turns into 

7172 



(71 + 72 + 73 - 7i) 



A,, 



Similarly, the nominator of ( [TtI i is equal to 

/ , 7172 ^ , "12(1) 

(71 +72 5—) ■ 73 = 71 73 + 7273 73- 

A, n 



(19) 



(20) 



Using (fT9] l and ( l20l i to replace the denominator and nominator 
of (O, we have 



Ai ■ (71 + 72 + 73 - 7i) - 
7172 

= 71 73 + 7273 5—73 

A, 



7172 
A,, 



(21) 



Moving every term to the LHS and multiplying Ai, we have 



C. Analysis 

The fundamental principle of maximum likelihood estimate 
is unveiled clearly by ([8]), where the LHS of ([8]) is the statistical 
distribution of the loss process (called model previously) and 
the RHS is the statistics obtained from observations. There is 
one to one correspondence between the terms cross the equal 
sign. The maximum likelihood estimator aims to solve the 
equation to find the Ak that fits to the statistic model; while 
the explicit estimator proposed previously attempts to use the 
last terms of both sides. When n < 00, the estimate can be 
different from MLE. In fact, if only considering asymptotic 
accuracy, each of the corresponding terms can be connected 
and considered an explicit estimator as the one presented in 
|[T3l . All of the explicit estimators can also be proved to be 
consistent as their predecessor. Then, the theorem 1 can be 
extended as 

Theorem 4. Each of the corresponding pairs of (O forms 
an explicit estimator that is consistent as the one proposed in 
M3L that has the form of 



$i(n„(l),u;,7) A, 



1 



C, 



/rifeej_7fey 

V "..(1) ) 



(24) 



where w corresponds to one of the pairs cross the equal sign of 
and \ w\ < \di \ denotes the number of members in the term, 
and 71^,(1) corresponds to the statistics denoting the number 
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of probes reaching the receivers attached to the descendants 
ofw. 

Proof: It can be proved as the proof of Theorem 2 in 

ma. ■ 

Clearly, when n < oo, the estimates obtained by the explicit 
estimators are not the MLE. Note that each term on the RHS 
of (O is not a sufficient statistic by itself but only a part of the 
sufficient statistics defined in Theorem[T] Combining a number 
of the estimators defined above can improve the accuracy of 
the estimate. When all terms are combined, we have the MLE. 

Despite having an explicit maximum likelihood estimator, 
we still carry out the following analysis to find out why the 
partial matching could not be an MLE. Firstly, we compare 
([8]l with the explicit one proposed in |13| on the basis of 
polynomial. The former is a |c?/j| — 1 degree of polynomial 
that has a unique solution in (0,1), while the latter can be 
considered a polynomial that has a multiple root in (0, 1). In 
other wards, the explicit one considers the sum of the first 
\dk\ — 2 terms on both sides of the equal sign are equal to 
each other, so does the last terms. Instead of using the sum of 
the first \dk \ — 2 terms to estimate Ak, the explicit estimator 
uses the last one to avoid solving a jd^: | — 2 degree polynomial. 
Using this approach to estimate A^, an error is inevitable and 
the amount of the error depends on the number of descendants; 
and the more the better This is because if a node has more 
descendants, we are able to obtain more information about the 
path connecting the source to the node from observations. 

The explicit estimator proposed in |13| can also be viewed 
as a method of moments. If so, its estimate is normally 
superseded by the maximum likelihood, because maximum 
likelihood estimators have higher probability of being close to 
the quantities to be estimated. Also, the estimates obtained by 
the method of moments are not necessarily based on sufficient 
statistics, i.e., they sometimes fail to take into account all or a 
large part of the relevant information in the sample. Therefore, 
the accuracy of such an estimator depends on large sample. 
As stated, ^^^(1) itself is not a sufficient statistic for Ak, 
using nd^{l) alone to estimate Ak fails to consider all other 
correlations between descendants. Then, as stated, error is 
inevitable. Let n be the sample size and d be the error rate, 
their relation is expressed as 

_5_ 

Based on the formula, the explicit estimator relies on sending 
infinite number of probes to reduce the error. Even though, 
the effect of ignoring other correlations remains, that makes 
the variance of the explicit estimator can only approximate to 
the first order of that of the maximum likelihood estimator 

D. Computational Complexity 

The estimator proposed in this paper is the maximum 
likelihood one with a similar computation complexity as that 
of the explicit estimator presented in lfT3l . To determine Ak 
or Ak, both need to calculate the empirical probabilities 
7,;,i G \ 0. The two estimators require to compute and 
rij(l), there are total 0(n ■ — 1)) operations. In addition. 



the estimator proposed here needs to merge descendants into 
two for those nodes that have more than 2 descendants. That 
requires to compute njfc(l), n^^jiiX) for node k, and there 

Idfcl \dk\ 

are 2 x (2 2 1) operations. On the other hand, the 

previous explicit estimator needs to calculate Bk that requires 
the computation of Z\, for each node that takes a smaller 
amount of operations than the MLE does. On the other hand, 
the explicit one needs to perform n-th root operation for each 
node that has more than two children, while the MLE one only 
needs a simple arithmetic operation to estimate Ak- Therefore, 
in terms of operations, the two are similar to each other. 

V. Conclusion 

In this paper, an explicit MLE is proposed that is built on 
the unique features of the likelihood equation and the set of 
alternative sufficient statistics introduced in this paper. The 
two features of the likelihood equation, i.e. expandable and 
merge-able, can be considered micro and a macro views of 
the likelihood equation. The rise of the macro view makes 
merging possible; and the rise of the micro view unveils the 
fundamental of the explicit estimator proposed previously and 
the internal correlations in the model and observations. Based 
on the macro view, a closed form MLE estimator is proposed 
and presented in this paper, which is of the simplest one that 
has even been presented in literature. Applying the micro view 
on Q, we establish the correlations between descendants, and 
we also establish the correspondence between the statistical 
model and the statistics obtained from the leaf nodes of the 
descendants. This correspondence further unveils the connec- 
tion between the observations and the degree of the likelihood 
polynomial. As a result, the explicit MLE is proposed for the 
tree topology. In addition to the explicit estimator, we in this 
paper compare the estimator proposed in this paper with the 
explicit estimator proposed previously, which shows that when 
n < 00, the MLE one is substantially better than the explicit 
one in terms of accuracy. 
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